Search CORE

189 research outputs found

Process-algebraic modelling of priority queueing networks

Author: Casale Giuliano
Tribastone Mirco
Publication venue
Publication date: 01/09/2010
Field of study

We consider a closed multiclass queueing network model in which each class receives a different priority level and jobs with lower priority are served only if there are no higher-priority jobs in the queue. Such systems do not enjoy a product form solution, thus their analysis is typically carried out through approximate mean value analysis (AMVA) techniques. We formalise the problem in PEPA in a way amenable to differential analysis. Experimental results show that our approach is competitive with simulation and AMVA methods

IMT Institutional Repository

Anomaly Detection for Big Data Technologies

Author: Alnafessah Ahmad
Casale Giuliano
Publication venue: OASIcs - OpenAccess Series in Informatics. 2018 Imperial College Computing Student Workshop (ICCSW 2018)
Publication date: 01/01/2019
Field of study

The main goal of this research is to contribute to automated performance anomaly detection for large-scale and complex distributed systems, especially for Big Data applications within cloud computing. The main points that we will investigate are: - Automated detection of anomalous performance behaviors by finding the relevant performance metrics with which to characterize behavior of systems. - Performance anomaly localization: To pinpoint the cause of a performance anomaly due to internal or external faults. - Investigation of the possibility of anomaly prediction. Failure prediction aims to determine the possible occurrences of catastrophic events in the near future and will enable system developers to utilize effective monitoring solutions to guarantee system availability. - Assessment for the potential of hybrid methods that combine machine learning with traditional methods used in performance for anomaly detection. The topic of this research proposal will offer me the opportunity to more deeply apply my interest in the field of performance anomaly detection and prediction by investigating and using novel optimization strategies. In addition, this research provides a very interesting case of utilizing the anomaly detection techniques in a large-scale Big Data and cloud computing environment. Among the various Big Data technologies, in-memory processing technology like Apache Spark has become widely adopted by industries as result of its speed, generality, ease of use, and compatibility with other Big Data systems. Although Spark is developing gradually, currently there are still shortages in comprehensive performance analyses that specifically build for Spark and are used to detect performance anomalies. Therefore, this raises my interest in addressing this challenge by investigating new hybrid learning techniques for anomaly detection in large-scale and complex systems, especially for in-memory processing Big Data platforms within cloud computing

Dagstuhl Research Online Publication Server

Process-Algebraic Modelling of Priority Queueing Networks

Author: Giuliano Casale
Mirco Tribastone
Publication venue
Publication date: 03/04/2020
Field of study

CiteSeerX

SampleHST: Efficient On-the-Fly Selection of Distributed Traces

Author: Casale Giuliano
Casale Giuliano
Gao Yicheng
Gao Yicheng
O’Brien Owen
O’Brien Owen
Perusquía José A.
Perusquía José A.
Sheldon Matthew
Sheldon Matthew
Ul Gias Alim
Ul Gias Alim
Publication venue: IEEE
Publication date: 01/01/2023
Field of study

Since only a small number of traces generated from distributed tracing helps in troubleshooting, its storage requirement can be significantly reduced by biasing the selection towards anomalous traces. To aid in this scenario, we propose SampleHST, a novel approach to sample on-the-fly from a stream of traces in an unsupervised manner. SampleHST adjusts the storage quota of normal and anomalous traces depending on the size of its budget. Initially, it utilizes a forest of Half Space Trees (HSTs) for trace scoring. This is based on the distribution of the mass scores across the trees, which characterizes the probability of observing different traces. The mass distribution from HSTs is subsequently used to cluster the traces online leveraging a variant of the mean-shift algorithm. This trace-cluster association eventually drives the sampling decision. We have compared the performance of SampleHST with a recently suggested method using data from a cloud data center and demonstrated that SampleHST improves sampling performance up to by 9.5×

WestminsterResearch

SampleHST: Efficient On-the-Fly Selection of Distributed Traces

Author: Casale Giuliano
Gao Yicheng
Gias Alim Ul
O'Brien Owen
Perusquía José A.
Sheldon Matthew
Publication venue
Publication date: 09/09/2022
Field of study

arXiv.org e-Print Archive